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Abstract 

^^ . We study the problem of ranking with submodular valuations. An instance of this problem 

^N ' consists of a ground set [m], and a collection of n monotone submodular set functions /^, . . . , /", 

where each /* : 21™! — >■ R+. An additional ingredient of the input is a weight vector w € R" . The 
objective is to find a linear ordering of the ground set elements that minimizes the weighted cover 
time of the functions. The cover time of a function is the minimal number of elements in the 
prefix of the linear ordering that form a set whose corresponding function value is greater than a 
unit threshold value. 

Our main contribution is an 0(ln(l/e))-approximation algorithm for the problem, where e 

-i^ . is the smallest non-zero marginal value that any function may gain from some element. Our 

(~^ ' algorithm orders the elements using an adaptive residual updates scheme, which may be of in- 

t/3 . dependent interest. We also prove that the problem is J7(ln(l/e))-hard to approximate, unless 

P = NP. This implies that the outcome of our algorithm is optimal up to constant factors. 
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1 Introduction 

Let / : 2'"^' — )• R be a set function, where [m] = {1, 2, . . . , m}. The function / is submodular iff 

f{S) + f{T)>f{SuT) + f{SnT), 

for all S,T CI [m] . An alternative definition of subinodularity is through the property of decreasing 
marginal values. Given a function / : 2'™"' — ?• M and a set S C [m], the function fs is defined by 
fsij) = /('S'U {j}) — f{S). The value /5(j) is called the incremental marginal value of element j to 
the set S. The decreasing marginal values property requires that fsiJ) is non-increasing function of 
S for every fixed j. Formally, it requires that fs{j) > frij), for all 5 C T and j £ [m] \T. Since the 
amount of information necessary to convey an arbitrary submodular function may be exponential, 
we assume a value oracle access to the function. A value oracle for / allows us to query about the 
value of f{S) for any set S. Throughout the rest of the paper, whenever we refer to submodular 
functions, we shall also imply normalized and monotone functions. Specifically, we assume that a 
submodular function / also satisfies /(0) = and f{S) < f{T) whenever S ^T. 

In this paper, we focus our attention on the problem of ranking with submodular valuations. An 
instance of this problem consists of a ground set [m] , and a collection of n monotone submodular set 
functions /"^, . . . , /", where each /* : 2^*"^ — )• R_|_. An additional ingredient of the input is a weight 
vector w G M" . The objective is to find a linear ordering of the ground set elements that minimizes 
the weighted cover time of the functions. The cover time of a function is the minimal number of 
elements in the prefix of the linear ordering that form a set whose corresponding function value is 
greater than some predetermined threshold. More precisely, the objective is to find a linear ordering 
vr : [tti] — )• [m] that minimizes X^iLi^jCj, where Cj is the cover time of function /*, defined as the 
minimal index for which /*({7r(l), . . . ,7r(ci)}) > 1. Here, 7r(t) stands for the element scheduled at 
time t according to the linear ordering vr. It is worth noting that the fact that each cover time 
is defined with respect to a unit threshold value does not limit the generality of the problem. In 
particular, given a vector A G M" , where Aj determines the cover threshold of function /*, one can 
obtain an equivalent instance by normalizing each /* with Aj, and updating all thresholds to 1. 

1.1 Our results 

Our main contribution is an 0(ln(l/e))-approximation algorithm for ranking with submodular val- 
uations, where e = min{/^(j) > 0} is the smallest non-zero marginal value that any function may 
gain from some element. We note that elements can have a marginal value of zero. Our algorithm 
orders the ground set elements using an adaptive residual updates scheme, which iteratively selects an 
element that has a maximal marginal contribution with respect to an appropriately defined residual 
cover of the functions. This approach has similarities with the well-known multiplicative weights 
method (see, e.g., pUllT]). Our algorithm is motivated by the observation that the natural greedy 
algorithm, which iteratively selects an element based on its absolute marginal contribution to the 
cover of the functions, performs poorly. In particular, a greedy type algorithm misjudges elements 
with low marginal contribution as less important, and therefore, unwisely schedules them late. 

We also establish that ranking with submodular valuations is Q(ln(l/e))-hard to approximate, 
assuming that P ^ NP. This implies that the outcome of our algorithm is optimal up to constant 
factors. This result is attained by demonstrating that the restricted setting of our problem in which 
there is a single function to cover already incorporates the set cover problem as a special instance. 
We would like to emphasize that even though this single function setting captures the computational 
hardness of the problem, it does not capture its algorithmic essence. The main algorithmic challenge 



that is addressed by our scheme is to obtain a good performance guarantee when there are many 
functions, each of which has a different Unear order that best suits its needs. In particular, one can 
easily validate that the natural greedy algorithm is essentially optimal in the single function setting. 
One additional interesting consequence of this result is that our problem generalizes both the set 
cover problem and its min-sum variant. This is the first problem formulation that has this property. 

1.2 Applications 

Web search ranking. One impetus for studying the above problem is an application in web search 
ranking. Web search has become an important part in the daily lives of many people. Recently, 
there has been a great interest in incorporating users behavior into web search ranking. Essentially, 
these studies make an effort to personalize the web search results (see, e.g., [331 [II [El E] ) . However, 
in the absence of any explicit knowledge of user intent, one has to focus on how to produce a set 
of diversified results that properly account for the interests of the overall user population |12^ [2]. 
In particular, it seems natural to utilize logs of previous search sessions, and try to minimize the 
average effort of all users in finding the web pages that satisfy their needs. When performing web 
search, a user usually reads the result items from top to bottom J27j. The time a user spends on 
reading the result items is the overhead in web search. 

The problem of ranking with submodular valuations can model the above-mentioned scenario as 
follows: there is a set of m search result items and there are n user types of known proportion. Each 
user type has a submodular relevance function that quantifies the information that the user type 
gains from inspecting any subset of result items. The goal is to order the result items in a way that 
minimizes the average effort of the user types. The effort of a user type is the number of result items 
it has to review until it gains a critical mass of relevant information. Notice that submodularity 
suits naturally for the ranking application since information in result items overlaps and does not 
necessarily complement each other. 

Broadcast in mobile networks. Another application that can be modeled by ranking with sub- 
modular valuations is broadcast in mobile networks. In this scenario, there is a base station that 
needs to sequentially transmit a set of m data segments. In addition, there is a collection of n clients, 
each of which is interested in some individual target data. Each data segment contains a mix of in- 
formation that may be relevant to a number of clients. The amount of information depends both on 
the data segment and the client. Moreover, there is informational redundancy between different data 
segments. This allows clients to extract their relevant target data from different subsets of segments. 
A client can extract her target data once she receives sufficient relevant information from the data 
segments. The goal is to set an order for the transmission of the data segments that minimizes the 
average latency of the clients. The latency of a client is the earliest time in which she receives data 
segments that contain enough information to decode her target data. Notice that the amount of 
relevant information that each client extracts from the data segments is submodular. 

1.3 Previous w^ork on special cases 

The problem of ranking with submodular valuations extends the multiple intents ranking problem [6] . 
One can demonstrate that an input instance for the latter problem can be translated to an instance of 
ranking with submodular valuations in which each function /* is linear, and the value that the func- 
tion has for any element is either or some value i^i G (0, 1] common to that function. The multiple 
intents ranking problem is known to admit a constant approximation by the work of Bansal, Gupta 
and Krishnaswamy [7j. Specifically, they presented a clever randomized LP rounding algorithm that 



improved upon a previous logarithmic approximation 0. The min-sum set cover problem can be 
modelled as a special case of multiple intents ranking in which each fj is boolean, i.e., fj € {0, 1}. 
The best known result for this problem is a 4-approximation algorithm that was developed by Feige, 
Lovasz and Tetali [16]. This algorithm was implicit in the work of Bay-Noy et al. [9]. The former 
paper also proved that 4-approximation is best possible, unless P = NP. The minimum latency 
set cover problem is another special case of multiple intents ranking in which each function /* has 
exactly l/i^i G N4. elements with non-negative value Vi. Hassin and Levin [21] studied this problem, 
and observed that it can be modeled as a special case of the classic precedence-constrained scheduling 
problem l\prec\ '^jWjCj. The latter problem has various 2-approximation algorithms (see, e.g., the 
survey [H]). Woeginger |34j demonstrated that the special case derived from minimum latency set 
cover is as hard to approximate as the general scheduling problem. This implies, in conjunction with 
a recent work of Bansal and Khot [8] , that it is hard to approximate minimum latency set cover to 
within a factor better than 2, assuming a variant of the Unique Games Conjuncture. 

1.4 Other related v^ork 

Submodular functions arise naturally in operations research and combinatorial optimization. One of 
the most extensively studied questions is how to minimize a submodular function. A series of results 
demonstrate that this task can be performed efficiently, either by the ellipsoid algorithm [20] or 
through strongly polynomial time combinatorial algorithms [3 H 124 1 [22 1 128 1 [23 } I26j. Recently, there has 
been a surge of interest in understanding the limits of tractability of minimization problems in which 
the classic linear objective function was replaced by a submodular one (see, e.g., [32l [191 US [25]). 
Notably, these submodular problems are commonly considerably harder to approximate than their 
linear counterparts. For example, the minimum spanning tree problem, which is polynomial time 
solvable with linear cost functions is r2(n)-hard to approximate with submodular cost functions |18j . 
and the sparsest cut problem, which admits an 0(Vln n)-approximation algorithm when the cost is 
linear [3] becomes Vl{^Jn/ In n)-hard to approximate with submodular costs [32]. Our work extends 
the tools and techniques in this line of research. In particular, our results establish a computational 
separation of logarithmic order between the submodular settings and the linear setting, which admits 
a constant factor approximation [5]. 

2 An Adaptive Residual Updates Scheme 

In this section, we develop a deterministic algorithm for the problem under consideration that has an 
approximation guarantee of 0(ln(l/e)), where e = min{/^(j) > 0} is the smallest non-zero marginal 
value that any function may gain from some element. An interesting feature of our algorithm is that 
it generalizes several previous algorithmic results. For example, if our algorithm is given a multiple 
intents ranking instance then it behaves like the harmonic interpolation algorithm of [6], if it is given 
a min-sum set cover instance then it reduces to the constant approximation algorithm of [16], and 
if it is given a (submodular) set cover instance then it acts like the well-known greedy algorithm for 
the set cover problem j35j . Nonetheless, it is important to emphasize that all these algorithms use 
fixed values in their computation; in contrast, our algorithm employs dynamically changing values. 

2.1 The algorithm 

The adaptive residual updates algorithm, formally described below, works in steps. In each step, 
the algorithm extends the linear ordering with a non-selected element that maximizes the weighted 



sum of its corresponding potential values. The potential value Pij of element j for function /* is 
initially equal to the marginal value f^ij)- As the algorithm progresses, it is adaptively updated 
with respect to the selected elements and the residual cover of /*, as formally presented in line [5j 
Intuitively, this update fashion gives more influence to values corresponding to functions whose cover 
draw near their thresholds. We emphasize that this dynamic update fashion is different than that of 
the exponential weights and harmonic interpolation techniques. Also note that our adaptive residual 
updates scheme is motivated by the observation that the natural greedy algorithm, which orders 
elements based on their absolute marginal contribution, fails to provide good approximation. This 
insufficiency is exhibited in Appendix lA.il 

Algorithm 1 Adaptive Residual Updates 

Input: A collection of n submodular set functions /^, . . . , /", where each /* : 2^"^' — > M_|_ 

A weight vector w £ M" 
Output: A linear ordering vr : [m] — )• [m] of the ground set elements 
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S^0 

for t -^ 1 to m do 
for i -^ 1 to n do 

for all j G [m] \ 5 do 

^ jo ns) > 1, 

^'^ ^ \min {l, £(^yWIzp)} otherwise 
end for 
end for 

Let j G [m] \ 5 be the element with maximal Xl^^ ^«-f«i 

s^su{j} 

Tr{t) ^ j 
end for 



2.2 Analysis 

In the remainder of this section, we analyze the performance of the algorithm. There are several 
techniques that we employ. We begin by establishing an interesting algebraic inequality applicable 
for any monotone function and any arbitrary sequence of element additions. For the purpose of 
bounding the cost of the solution of our algorithm, we compare it to a collection of solutions induced 
by the optimal linear ordering applied to truncated instances of the problem. We also utilize and 
extend the analysis methods presented in |16l [6] . 

Theorem 2.1. The adaptive residual updates algorithm constructs a linear ordering whose induced 
cost is no more than 0(ln(l/e)) times the optimal one. 

Proof. We begin by introducing the notation and terminology to be used throughout this proof: 

• Let OPT and ALG be the cost induced by the optimal linear ordering and the linear ordering 
TT constructed by the algorithm, respectively. 

• Let P be the final state of the potential values matrix maintained during the algorithm. Notice 
that the potential values of each column of P are the values that the corresponding element 
had when it was selected by the algorithm. 



• Let It = {i '■ Ci > t} be the indices of the functions that were not covered before step t of the 
algorithm, and let Rt be the relative cost of step t of the algorithm. Specifically, Rt = Yla^h ^«- 
In addition, let Qt = X^"=i '^iPin{t) be the weighted sum of potential values corresponding to 
the element selected at step t of the algorithm, and Af = Y17=t' Qt- FinaHyi let Aj = Rt/Qt 
be the penalty of step t. 

We can now reinterpret the cost induced by the linear ordering vr using the mentioned notation. 
In particular, one can validate that ALG = Y17=i''^i'^i — 'YlT=i^t ~ YlT=i^tQt- "^^^ ^^^^ lemma 
bounds Aj/ in terms of Rt' ■ 

Lemma 2.2. Let 7 = ln(l/e) + 2. Then, A^/ < jRt' , for every t' G [m]. 

Proof. In what follows, we demonstrate that Yl^i ^init) — 7 holds for every function /*. Notice 
that if we establish this argument then the lemma follows since 

m n m. m 

t=t' i=l t=t' iel^i t=l iel^, i&I^i 

where the second equality results by noticing that each function /* covered before step t' must have 
Piirit) = for every t > t' . As a result, the only functions that may have strictly positive potential 
values are those that were not covered before step t', namely, those in /j/. 

Consider the function /*, and let us assume that its cover time is k. Let St = {vr(l), . . . , vr(t)} be 
the set of elements ordered up to (and including) step t according to vr. In particular, let 5*0 = 0. One 
can verify that the potential values of function /* satisfy P-n^^t) = {P{St) — /*(5't_i))/(l — /*(S't_i)) 
for every t < k, PiTr{k) = li and Pi-n-^t) = for every t > k. Consequently, we get that 

"^ fc— 1 fi/Q\ fife \ 

where the inequality follows from Claim 12. 3[ This claim establishes a generic bound which applies 
to any monotone function and any arbitrary sequence of element additions. The desired bound is 
obtained by utilizing the claim with respect to the submodular function /* and the collection of sets 
5*0, . . . , Sk-i- One should also notice that /*(5fc_i) < 1 by construction, and that e < 6. ■ 

Claim 2.3. Given a monotone function f : 2'™' — )■ ]R_|_, and a collection of set 5o ^ • • • C 5^ C [m] 
such that f{So) = and f{Si) < 1 then 

where 5 = mint{/(S'f) - f{St-i) > 0}. 

Proof. The monotonicity property of the function / guarantees that = /(jSq) < • • • < f{Si) < 1- 
We can also assume without loss of generality that f{Si/) — f{Si-i) > 6 > 0, since otherwise, the last 
term in the above-mentioned summation must be equal to 0, and therefore, may be neglected. Now, 
notice that for any t = 1, . . . ,i — 1, 



f(St-i) 1 - X JfiSt-i) 1 - /('S't-i) 1 - fiSt^i) 

5 



where the first inequahty resuhs from the fact that the function 1/(1 — re) is monotonically increasing 
for X € [0,1). Furthermore, notice that {f{Se) — /(5f_i))/(l — f{Si^i)) < 1. This simply fohows 
since we know that f{Se) < 1. Combining the previously stated arguments, we attain that 



^ /(5.) - /(S.-i) ^ y! f{St] - /(S.-i) , , , ^ ["■'''•> 1 . ^, 

g 1-/(5.-0 - S i-/(s.-.) ^'-SL-.,^'"^' 

-■/(•Sf-i) 1 / 1 



/ dx + 1 = In + 1 < ln(l/5) + 1 , 



'/(So) 

where the last inequality holds since 1 — f{Si^i) > /{Si) — f{Si^i) > 5. ■ 

We continue by introducing a collection of histograms. These histograms will be utilized to bound 
the cost of the algorithm in terms of the cost of the optimal solution. 

The optimal solution as a histogram. The histogram that relates to the optimal solution consists 
of n bars, one for each function. The bar associated with function /* has a width of Wi, while its 
height is equal to the step in which that function was covered in the optimal solution. A function 
is regarded as covered at step t if its cover time according to the optimal ordering is t. The bars in 
the histogram are ordered according to the steps in which the corresponding functions were covered 
in the optimal solution. Notice that this implies that the histogram is non-decreasing. Furthermore, 
notice that the total width of the histogram is Y17=i '^^i ^^^ ^^^ overall area beneath it is OPT. 

A collection of truncated solutions as histograms. We now define a collection of m histograms, 
each of which corresponds to a solution of a truncated input instance with respect to some step t of the 
algorithm. Informally, a truncated instance is a relaxation of the input instance that admits better 
solutions than the optimal solution. As a result, the collection of histograms establishes a connection 
between the optimal solution and the solution of the algorithm. For the purpose of defining the 
truncated input instance corresponding to step t, let S = {vr(l), . . . , 7r(f — 1)} be the set of elements 
selected by the algorithm before step t. The truncated instance is obtained by incrementally applying 
the following two modification steps to the underlying input instance: 

(i) a set of elements is given for free. We modify the instance by giving all the elements in S 
for free. The impact of this modification is two-fold: first, all the functions that were covered by 
the algorithm up to that step cannot incur any cost, and basically, they can be removed from the 
modified instance; second, the threshold of each function /* that was not covered by the algorithm up 
to that step decreases by f^{S), that is, its threshold in the modified instance becomes Aj = 1 — f^{S). 
Now, notice that in order to translate this instance to the canonical form in which all thresholds are 
equal to 1, one has to normalize each /* with the corresponding threshold Aj. It is important to note 
that the marginal values of each function /* at step t of the algorithm are normalized exactly by this 
term (to obtain the corresponding potential values). 

(ii) the cost of each function is relaxed. It is implicit in our problem definition that each function 
/* has a matching cost function pi : [0, 1] — >• M-(-, which represents the cost that /* collects per step 
with respect to its cover. Specifically, letting x indicate the "extent of cover" of function /*, its cost 
is a simple step- function, defined as follows: 

Pi{x) 




Namely, the function /* collects a cost of Wi in each step until it is covered. We modify the cost 
function of each /* to be continuously decreasing with constant derivative. In particular, the updated 
cost function of /* becomes Pi{x) = Wi ■ {1 — x). Notice that this modification implies that even a 
partial cover of a function decreases its cost per step. The interpretation one should have in mind is 
that X represents the fraction of covered weight, and once some fraction of weight is covered it stops 
incurring cost. 

The crucial observation one should make regarding the resulting truncated instance is that the lin- 
ear ordering constructed by the optimal algorithm induces a solution for this instance whose matching 
histogram is non-decreasing and completely contained within the optimal solution histogram. The 
latter argument is formally presented and proved in the following lemma. 

Lemma 2.4. The optimal linear ordering constructed with respect to the original instance induces 
a solution for the truncated instance whose matching histogram is completely contained within the 
optimal solution histogram when aligned to its lower right boundary. 

Proof. Prior to proving this lemma, it is important to note that the histogram built with respect to 
the induced solution is defined in a slightly different way than the optimal solution histogram. The 
difference results from the fact that the cost per step of each function p decreases as it is covered. 
Specifically, this truncated solution histogram has the same interpretation of the axes as the optimal 
solution histogram, and its bars are ordered according to non-decreasing heights. However, the 
number of bars depends on the underlying solution. For instance, suppose that function /* was 
incrementally covered using positive portions 5i,...,6k such that ^i^i^i = 1 in steps ti,...,tfc. 
This function gives rise to k bars in the histogram, where bar i has a width of Wi5e and a height of 
t£. Indeed, the total area beneath these bars is the relative cost of function /* since the cost per step 
of that function is Wi between steps to = and ti, Wi ■ {1 — 6i) between steps ti and t2, and so on. 
Accordingly, one can validate that 

k e-i k 



Y^(l-Y,5r)-{h-t,_^)=W^Y, 



Wi 

e=i r=l 

Now, for the purpose of establishing the lemma, we utilize the following simple claim, whose proof 
appears in Appendix lA.2[ The claim presents a transformation that may be applied to non-decreasing 
histograms and does not increase their upper boundary. 

Claim 2.5. Consider a non- decreasing histogram and suppose we modify it by decreasing the height 
of some part of a bar and then updating its x-axis position to maintain the non- decreasing property. 
The resulting histogram is completely contained within the primary histogram. 

We demonstrate that the modifications used to generate the truncated instance can be trans- 
lated into a sequence of the above-mentioned transformation that generates the truncated solution 
histogram from the optimal solution histogram: 

(i) an element is given for free. Suppose that the element given for free was scheduled at step 
i of the optimal linear ordering. Notice that all the elements scheduled at steps i' > l according 
to the optimal linear ordering are scheduled at step £' — 1 in the induced solution. This follows as 
the element under consideration does not appear in the induced linear ordering for the truncated 
instance. This implies that the cover time of all the functions that are critical with respect to 
these elements decreases by 1. We say that an element j is critical for a function if that function 



topped its threshold after element j was selected. Accordingly, the height of the corresponding bars 
in the histogram decreases by 1. This translates to a sequence of the mentioned transformation. 
Furthermore, notice that all the functions that were covered up to step i according to the optimal 
linear ordering may be covered in prior steps in the induced solution. This is due to the "free partial 
cover" that the element induces. In particular, the functions that were covered at step i of the 
optimal linear ordering must be covered in prior steps in the induced solution. The cover time of 
each of these function may vary depending on the extent of their cover with respect to the element 
under consideration and previously scheduled elements. Still, it is clear that this cover time must be 
strictly smaller than in the optimal linear ordering. Consequently, the height of the bars associated 
with these functions decreases. Again, this translates to a sequence of the mentioned transformation. 
Figure [1] provides an illustration of this modification. 
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Figure 1: The impact of giving the element scheduled at step 3 for free: (a) The initial histogram. 
Note that the numbers written inside the bars represent the new cover time of the corresponding 
functions after the element under consideration was given for free, (b) The resulting histogram. 

(ii) The cost of a function is relaxed. Suppose the cost of some function /* was relaxed, and let us 
assume that /* was incrementally covered using positive portions 5i, ... ,6k such that X^^^^ (5^ = 1 in 
steps ii, . . . , tfc of the optimal linear ordering. Notice that as a result of the relaxation, the histogram 
should consist of k bars instead of the single bar corresponding to function /*. In particular, each 
bar i should have a width of Wi5e and a height of t^. This can be interpreted as replacing a single bar 
having respective width and height of Wi and t^ with k bars whose total width is Wi X^^^^ Si = Wi 
and each has a height of at most t^. It is easy to verify that this may translate to a sequence of the 
mentioned transformation. ■ 



The solution of the algorithm as a histogram. The histogram that corresponds to the solution 
generated by the algorithm consists of nm bars, one for each entry of the potential values matrix. 
The width of each bar corresponding to entry (i, j), covered at step t of the algorithm, is its weighted 
potential value WiPtj, while its height is the penalty of the corresponding step A^. Note that an entry 
{i,j) is regarded as covered at step t of the algorithm if 7r(t) = j. The bars are ordered according to 
the steps in which the corresponding entries were covered by the algorithm. Notice that the ordering 
of an element j at step t gives rise to n bars in the histogram whose total width is Qt = X]"=i WiPij- 
Moreover, note that the total width of the histogram is Xl^i Qt, which is at least as large as (and 
maybe much larger than) the total width of the optimal histogram, and that the area beneath the 
histogram is X^^^i MQt, which is precisely ALG, as previously noted. 

Having all the histograms definitions in place, we are now ready to prove the theorem. We claim 
that the area beneath the histogram corresponding to the solution of the algorithm is at most 47 
times larger than the area beneath the histogram of the optimal solution, where 7 = ln(l/e) + 2. 



Let us consider the transformation that shrinks the width and height of each bar of the algorithm's 
histogram by a factor of 27 and 2, respectively. Specifically, after applying the transformation, the 
bar corresponding to entry (i, j) covered at step t has a width of WiPij / {2j) and a height of At/2. We 
next argue that this shrunk histogram is completely contained within the optimal solution histogram 
when aligned with its lower right boundary. Notice that this implies that the area beneath the 
shrunk histogram is no more than the area beneath the optimal solution histogram, implying that 
ALG/(47) < OPT, and therefore, proving the theorem. 

For the purpose of establishing this containment argument, let us focus on an arbitrary point 
p' in the histogram of the algorithm. We assume without loss of generality that it lies in the bar 
corresponding to entry {i',j') covered during step t' of the algorithm. This implies that the height of 
p' is at most A^ , and its distance from the right side boundary is no more than A^' . Let us consider 
the point p, which is the mapping of p' in the shrunk histogram. Note that the height of p is at most 
Ati/2, while its distance from the right boundary is at most Afi/{2-f). In the following, we prove 
that p lies within the truncated solution histogram corresponding to step t'. This is achieved by 
demonstrating that the linear ordering induced by the optimal solution for the truncated instance 
has at least At'/(27) weight to cover by time step [Af//2j. In fact, we establish a more powerful 
property that states that any ordering of elements of the truncated instance has at least Ai//(27) 
weight to cover by time step [At'/2j. Now, recall that Lemma 12.41 guarantees that the histogram of 
the truncated solution is completely contained within the optimal solution histogram, and hence, we 
obtain that p must also lie within the histogram of the optimal solution. 

Let us concentrate on the set of elements in the truncated instance corresponding to step t' . We 
argue that any element selected in any step of any linear ordering cannot reduce the weight by more 
than Qfi = ^^=iWiPij' ■ This argument results from the construction of the truncated instance, 
the submodularity of the functions, and the greedy selection rule of the algorithm. Specifically, the 
construction of the truncated instance guarantees that the (initial) marginal value of each element 
j for each function /* is equal to the potential value Pij at step t' of the algorithm. This relates to 
the normalization after the set of elements was given for free. Also note that Pij can be interpreted 
as the fraction of the weight Wi that may be covered when selecting element j. This corresponds to 
the modification of the cost function of each function in the truncated instance to be continuously 
decreasing with constant derivative. The submodularity of the functions, which involve decreasing 
marginal values, ensures that the marginal value of each element can only decrease over time, that 
is, it cannot be greater than Pij in any future step. This implies, in conjunction with the greedy 
selection rule of the algorithm, which selects the element f that maximizes the above-mentioned 
term, that any element selected in any step of any linear ordering cannot cover a weight of more 
than Qti. Consequently, any linear ordering may cover at most [A(//2j • Qti < Rt'/2 weight by step 
[A4//2J. Recall that the overall weight of the functions in the truncated instance is exactly Rti, and 
thus, at least Rt'/2 weight is left uncovered. Finally, Lemma F2. 21 guarantees that Rt'/2 > At'/{2^). ■ 

3 An Inapproximability Result 

In this section, we establish that ranking with submodular valuations is ri(ln(l/e))-hard to approx- 
imate, assuming that P 7^ NP. This implies that the outcome of the algorithm from Section [2] is 
optimal up to constant factors. The essence of the proof is by showing that our problem incorporates 
the set cover problem as a special instance. In fact, we demonstrate that even the seemingly simple 
scenario in which there is only one function to cover already generalizes the set cover problem. 



Theorem 3.1. The ranking with subniodular valuations problem cannot be approximated within a 
factor o/cln(l/e), for some constant c> 0, unless P = NP. 

Proof. An instance of a set cover problem consists of the ground set X = {1, . . . , n} and a collection 
of sets J- = {Si, . . . , Sm} ^ 2^. The objective is to find a subfamily of J- of minimum cardinality 
that covers all elements in X. Set cover is known to be NP-hard to approximate within a factor of 
O(lnn). In other words, there is a constant c > such that approximating set cover in polynomial 
time within a factor of clnn implies P = NP. This result follows by plugging the proof system of 
Raz and Safra |30j . or alternatively, Arora and Sudan [3] into a reduction of Bellare et al. [10] (see 
also the result of Feige |15j , which shows inapproximability under a slightly stronger assumption) . 

Given a set cover instance, we define an instance of ranking with subniodular valuations as follows. 
There are m elements, each corresponds to a set in J^. Furthermore, there is one submodular set 
function whose corresponding weight is 1, namely, w = (1). The submodular function is defined as 



m = l\[js. 



Notice that the resulting instance is a valid ranking with submodular valuations instance as the func- 
tion / is normalized, monotone, and submodular. For example, / satisfies the decreasing marginal 
values property since 

frij) = - |i : i G 5j and i ^ y Sij 

is non-increasing function of T for every fixed j. 

It is easy to see that a set cover in the original instance can be converted to a linear ordering 
of the elements in the newly-created instance of identical cost. Specifically, one should order the 
elements that correspond to the sets of the set cover first (in some arbitrary way), and then order 
the remaining elements. Conversely, it is not difficult to verify that given a linear ordering of the 
elements, we can perform a similar cost-preserving transformation in the opposite direction. This 
implies that unless P = NP, it is impossible to approximate the ranking with submodular valuations 
problem to within a factor of cln n > cln(l/e), where the inequality holds as e is the smallest non-zero 
marginal value which is clearly at least 1/n. ■ 

4 A Concluding Remark 

Incorporating cost into the problem. As previously noted, it is implicit in the definition of 
the problem that each function that needs to be covered has a matching step-function representing 
the cost that the function collects per step with respect to its cover. Specifically, the step-function 
corresponding to function /* is determined by its step height Wi. It is only natural to consider the 
generalization in which each /* has an arbitrary non-increasing cost function p^ : [0, 1] — t- M+ instead 
of the height parameter wi. One can demonstrate that our techniques can be utilized to solve this 
variant. The main idea is to reduce the non-increasing cost function case to the step-function case. 
This can be done by carefully approximating each non-increasing cost function by a collection of 
step- functions, as schematically described in Figure [2j The full version of the paper will provide a 
detailed description of this result. 

Acknowledgments: The authors would like to thank Oded Regev for useful discussions on topics 
related to this paper. 
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Figure 2: The reduction from an arbitrary non-increasing cost function to step-functions. 
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A Additional Details 

In this section, we present details omitted from the main part of the paper. 

A.l The natural greedy algorithm is insufficient 

Let us consider the greedy algorithm that is formally described below. This algorithm is built from 
a sequence of greedy steps that set up the linear ordering. In each step, the algorithm selects a 
non-selected element that has a maximal marginal contribution to the functions. The contribution 
of element j to each function /* is the gain that j provides towards the threshold of /*, as formally 
exhibited in line[5j 

Unfortunately, as the following theorem demonstrates, the greedy algorithm fails to provide good 
approximation. The shortfall of the algorithm is that it misjudges elements with low marginal 
contribution as less important, and thus, schedules them late. As one may expect, this turns to be 
crucial when many functions depend on a single element which has a low marginal contribution. 

Theorem A.l. The cumulative greedy algorithm has an approximation ratio of Q{n^''^). 

Proof. We consider an input instance that consists of ?7i = n^' ^ -|- 2 elements, n functions, and a 
weight vector that all its entries are identical. All the functions in the input instance are assumed 
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Algorithm 2 Cumulative Greedy 



/", where each /* : 2^' 



Input: A collection of n submodular set functions f^, . 

A weight vector w G M" 
Output: A linear ordering vr : [m] — )• [m] of the ground set elements 

1: S^$ 

2: for t -^ 1 to 77T, do 

3: for i -^ 1 to n do 

4: for all j G [m] \ 5 do 

p,^i0 f{S)>l, 

'' \mm{riSU{j})-riS),l-fiS)} otherwise 
6: end for 

7: end for 

8: Let j G [m] \ S" be the element with maximal Y17=i '^i^ij 
9: S^S\J{J] 

10: ^{t) ^ j 

11: end for 



to be linear. A function / : 2^"^^ — )■ ]R+ is called linear if there is a valuations vector v G 
that f{S) = ^jizs^j- Accordingly, we represent the functions using the following matrix. 

/ 1 



such 



n n 
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1 1 
n n 


1 































1 


/ 



v 

In particular, the i-th row represents the values that function /* has for the elements. Note that the 
size of -F's upper-left non-zero sub-matrix is {n—v}''^) x 2, while the size of its lower-right identity sub- 
matrix is n^ " X v}'"^ . Let us analyze the performance of the greedy algorithm on this instance. Notice 
that in each step, the algorithm extends the linear ordering with a non-selected element (column) 
whose sum of entries is maximal. This follows since all weights are identical, and the sum of entries of 
each row of F is exactly 1 . Consequently, the algorithm initially orders element 1 , then elements 3 to 
j^i/2_|_2^ and finally, element 2. The cost of the algorithm is {n — n^/'^)-{'n}/'^+2) + {2+. . .-l-n^/^ + l) = 
fl(n^'^). On the other hand, ordering the elements according to their column number induces a linear 



ordering whose cost is (n 



n 



1/2N 



■2 + {?, + ...+n^/'^ + 2) = 0{n). 



A.2 Proof of Claim [2751 

Consider an arbitrary point p initially positioned at coordinate (x, y) in the histogram. Notice 
that unless this point is in the area removed from the histogram due to the height decrease, it is 
transformed to a new position (x',y') as result of the x-axis position update described in the claim. 
We argue that this new position is contained in the primary histogram. If p was initially positioned 
in the bar whose height was decreased then the argument clearly holds since the corresponding bar 
is shifted while maintaining the non-decreasing property. Otherwise, the new position must satisfy 
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x' > X and y' = y. However, this implies that it must be contained in the primary histogram since 
p was initially in the histogram and this histogram is non-decreasing. Figure [3] provides a schematic 
description of the transformation. 





(a) 



(b) 



(c) 



Figure 3: The transformation described in the claim: (a) The primary histogram; (b) The result of 
the height decrease; (c) The result of the x-axis position update. 
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